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HUMAN T CELL RESPONSE TO MHC-BINDIlPr<|} MCJfat i:LTlSmRS:|i 

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH 
This invention was made with United States Government support from the National 
Institutes of Health (NTH Grant No. R01-AI3571). The Government may have certain rights in 
the invention. 

TECHNICAL FIELD OF THE INVENTION 
This invention relates generally to vaccines and to computer-based algorithms used to 
predict epitopes. 

BACKGROUND OF THE INVENTION 

The reemergence of tuberculosis as a public health issue, particularly Mycobacterivm 
tuberculosis (Mtb) superinfection of Human Immunodeficiency Virus (HIV)- infected individuals, 
has prompted the need for improvements in vaccination. Recognition of and response to 
Mycobacterium tuberculosis protein antigens by CD4+ T cells requires the intracellular 
processing of these antigens, and the subsequent presentation of the derived peptides by class n 
major histocompatability complex (MHC) molecules at the surface of antigen presenting cells 
(APC). To identify these T-cell epitopes, the standard approach has been to synthesize 
overlapping peptides spanning the entire sequence of a given protein antigen. These peptides are 
then tested for their capacity to stimulate T cell proliferative responses in vitro^ using cells firom 
Mtb immune individuals. Although this overlapping peptide method is thorough, it is both cost- 
and labor-intensive. 

The interaction between Mtb protein sequences and the molecules of the immune system 
(the human leukocyte antigens, "HLA"), which present peptides derived fi"om the proteins of the 
challenge protein to the immune system and to engage vaccine-trained T cells to respond, can lead 
to variations in immune responses. Due to the tight-fit nature of the interaction between 
Mtb-derived peptides and the HLA, changes in amino acid sequence of a challenge strain may 
interfere with the ability of a given peptide to bind to the HLA molecule, thereby preventing 
recognition of the challenge strain by T cell clones raised against a vaccine construct. 

Sequence modifications at the amino acid level may affect the recognition of the epitope in 
three ways: (1) by affecting intracellular processing, (2) by interfering with binding (of the 
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peptide) to major histocompatibility (such as major histocompatibility complex (MHC) or HLA) 
molecxiles and presentation of the peptide-HLA complex at the antigen presenting-cell surface, 
and (3) by interfering with binding of the epitope to the T cell receptor (TCR) {See Germain & 
Margulies, Ann. Rev. Immunol. 11:403 (1993); Falk etal. Nature 351:290 (1991)). 

Computer-based algorithms have been designed to predict T cell epitopes from the amino 
acid sequences of proteins, and to diminish the cost and labor associated with the identification of 
T cell epitopes by the overlapping peptide method. See DeGroot, et al.. New Generation 
Vaccines, 2""*^ Ed. (1996); Meister, etal., Vaccine 13:581 (1995); Roberts, etaL, ATOS Res. Hu. 
Retrovir. 7:593 (1996); Hammer, etaL, J. Exp. Med. 180:2353 (1994); Davenport, etaL, 
Lnmunogenetics 42:392 (1995); Fleckenstein, etaL, Eur. J. Biochem. 240:71 (1995). One such 
algorithm, EpiMer, predicts putative T cell epitopes by searching an amino acid sequence for 
regions containing clusters of MHC-binding motifs. These "motifs" are defined as recurring amino 
acid patterns found in a large percentage of peptides that bind to specific MHC alleles. 

SUMMARY OF THE INVENTION 
EpiMer is a computer-based algorithm for predicting T-cell epitopes within protein 
antigens by searching for clusters of major histocompatability complex molecule (MHC) binding 
motifs EpiMer was used to identify putative epitopes for ?out Mycobacterium tuberculosis (Mtb) 
antigens, 14 kDa, 16 kDa, 19 kDa, and 32 kDa. A total of 23 putative epitopes were predicted, 
and 28 corresponding peptides were synthesized. Lymphoproliferation assays were conducted 
using these peptides and peripheral blood mononuclear cells from 40 Mtb-immune and 19 Mtb- 
naive subjects recruited from State Tuberculosis Clinic in Providence, RI; the Lemuel Shattuck 
Hospital, Jamaica Plain, MA; and the Research Institute of Tropical Medicine, Manila, the 
Philippines. Of the 28 peptides tested, all were found to induce a proliferative response in at least 
one Mtb immune individual. Predicted epitopes that contained a higher number of MHC-binding 
motifs were more likely to stimulate T cell response in a greater number of Mtb immune 
individuals than those with a lower number of MHC-binding motifs (RR 5.0; 95% confidence 
intervals 1.7 to 14). There was an increased likelihood of having a proliferative response to a 
peptide which contained an MHC-binding motif matched for the subject's allele (RR = 1.5, 95% 
CI 0.9 to 2.5). 

Algorithms such as EpiMer, which search for regions of MHC-binding motif clustering, 
may be useful for the development of subunit vaccines against Mtb. 
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The invention provides Mtb vaccine candidate peptides, including the peptides shown as 
SEQ ID NOS: 1-28. The invention also provides an Mtb vaccine, which is an Mtb peptide in an 
immunologically acceptable excipient, such as any of the vaccine carriers known in the medical 
arts. The invention also provides a method for identifying Mtb vaccine candidates that could be 
presented in the context of more than one HLA. 

The details of one or more embodiments of the invention are set forth in the accompanying 
description. Although any methods and materials similar or equivalent to those described herein 
can be used in the practice or testing of the invention, the preferred methods and materials are 
now described. Other features, objects, and advantages of the invention will be apparent from the 
description and from the claims. In the specification and the appended claims, the singular forms 
include plural referents unless the context clearly dictates otherwise. Unless defined otherwise, all 
technical and scientific terms used herein have the same meaning as commonly understood by one 
of ordinary skill in the art to which this invention belongs. All patents and publications cited in this 
specification are incorporated by reference. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 shows the MHC-binding motif density histograms for the four Mtb protein antigens 
studied, based on predictions by EpiMer ML 1994. The number of MHC-binding motifs is plotted 
against the midpoint of an 11 amino acid reading frame. White bars above the motif density 
histogram indicate peptides tested in other laboratories; epitopes described by these laboratories 
are indicated in black. All bars below the motif density histogram represent peptides synthesized 
to correspond to EpiMer predictions; grey bars indicate peptides which also corresponded to 
published epitopes; black bars indicate when these peptides were also recognized by six or more 
subjects in the study cohort. FIG 1(a) shows the 14 kDa Mtb protein; FIG. 1(b) shows the 16 kDa 
Mtb protein; FIG. 1(c) shows the 19 kDa Mtb protem; and FIG. 1(d) shows the 32 kDa Mtb 
protein. 

FIG. 2 shows the results of lymphoproliferation assays to PPD, TT, PHA, and to peptides 
performed in Providence, BI and Manila, the Philippines. Solid boxes indicate responses of SI ^ 
3 .0, grey boxes indicate SI > 2.0, open boxes indicate SI < 2, N/D indicates the wells with PHA 
that were not done for that subject. In those cases where response differed between the 1 |ig/ml 
and 10 ^ig/ml peptide concentration, the data shown are for the higher response. FIG. 2(a) shows 
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the results of the Mtb-immune group (n=40). FIG. 2(b) shows the results of the Mtb-n^ve 
group (n=19). 

FIG. 3 is a scatterplot demonstratmg the association between the number of motif matches 
contained within a peptide and the number of Mtb-immune subjects who respond to that peptide. 
FIG. 4 is a table containing a "full list" of Mtb peptides. 

DETAILED DESCRIPTION OF THE INVENTION 

EpiMer, and other MHC-binding motif-based algorithms, may be usefiil methods for 
identifying "promiscuous" peptides which can be recognized by a higher number of individuals in 
outbred human populations. The cost and time savings of this method over the traditional 
overlapping approach are substantial, and this method may eventually contribute to the 
development of a novel sub-unit vaccine against Mtb. 

The EpiMer algorithm was applied to four Mtb protein antigens, all of which were 
selected for analysis because they had been previously shown to stimulate proliferative responses 
in Mtb-infected subjects. The purpose of this study was to prospectively confirm the utility of the 
EpiMer algorithm, by (1) measuring the response of Mtb immune subjects to EpiMer-predicted 
peptides containing clusters of MHC-binding motifs, and by (2) measuring individual responses to 
other peptides containing motifs matched to the subjects' HLA-DR allele. 

Vaccines can include any one of the Mtb vaccine candidate peptides disclosed below, 
either alone, in combination with suitable carriers, linked to carrier proteins, or expressed from a 
polynucleotide, such as a "naked DNA" vaccine. The peptides can be administered to a host for 
treatment of Mtb. The peptides can also be used to enhance immunologic function. 

Peptides, The Mtb vaccine candidate peptides can be produced by well known chemical 
procedures, such as solution or solid-phase peptide synthesis, or semi-synthesis in solution 
beginning with protein fragments coupled through conventional solution methods, as described by 
Dugas & Penney, Bioorganic Chemistry^ 54-92 (Springer- Verlag, New York, 1981). For 
example, peptides can be synthesized by solid-phase methodology utilizing a PE- Applied 
Biosystems 43 OA peptide synthesizer (commercially available from Applied Biosystems, Foster 
City, CA) and synthesis cydes supplied by Applied Biosystems. Boc amino acids and other 
reagents are commercially available from PE- Applied Biosystems and other chemical supply 
companies. Sequential Boc chemistry using double couple protocols are appUed to the starting 
p-methyl benzhydryl amine resins for the production of C-terminal carboxamides. After synthesis 
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and cleavage, purification is accomplished by reverse-phase CI 8 chromatography (Vydac) column 
in 0.1% TFA with a gradient of increasing acetonitrile concentration. The solid phase synthesis 
could also be accomplished using the FMOC strategy and a TFA/scavenger cleavage mixture. 

When produced by conventional recombinant means, the Mtb vaccine candidate peptide 
can be isolated either ftom the cellular contents by conventional lysis techniques or from cell 
medium by conventional methods, such as chromatography (see, e.g. , Sambrook et al. Molecular 
Cloning. A Laboratory Manual, 2d Edition (Cold Spring Harbor Laboratory, New York (1989). 

In one embodiment, the Mtb vaccine candidate peptide as a maximtim size of 50 amino 
acids in length and a minimum size of 8 amino acids to 1 1 amino acids (for the relevant SEQ ID 
NOS). The peptide can be any size between the minimum to maximum size, and one Mtb vaccine 
candidate peptide can be of a given size independently of another Mtb vaccine candidate peptide. 
For example, one Mtb vaccine candidate peptide can be 25 amino acids in length while another 
Mtb vaccine candidate peptide is 45 amino acids in length. 

Peptides as antigens. The Mtb vaccine candidate peptides are usefiil as antigens for raising 
anti-Mtb immune responses, such as T cell responses (cytotoxic T cells or T helper cells). An 
"antigen" is a molecule or a portion of a molecule capable of stimulating an immune response, 
which is additionally capable of inducing an animal or human to produce antibody capable of 
binding to an epitope of that antigen. An "epitope" is that portion of any molecule capable of 
being recognized by and bound by an MHC molecule and recognized by a T' cell or bound by an 
antibody. An antigen can have one or more than one epitope. The specific reaction indicates that 
the antigen will react, in a highly selective manner, with its corresponding MHC and T cell, or 
antibody and not with the multitude of other antibodies which can be evoked by other antigens. 

A peptide is "inamunologically reactive" with an T cell or antibody when it binds to an 
MHC and is recognized by a T cell or binds to an antibody due to recognition (or the precise fit) 
of a specific epitope contained within the peptide. Immunological reactivity can be determined by 
measuring T cell response in vitro or by antibody binding, more particularly by the kinetics of 
antibody binding, or by competition in binding using as competitors a known peptides containing 
an epitope against which the antibody or T cell response is directed. The techniques for 
determining whether a peptide is immunologically reactive with a T cell or with an antibody are 
known in the art. The peptides can be screened for eflBcacy by in vitro and in vivo assays. Such 
assays employ inununization of an animal, e.g., a rabbit or a primate, with the peptide, and 
evaluation of titers antibody to Mtb or to synthetic detector peptides corresponding to variant 
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Mtb sequences. Methods of detennining the spatial cojoformation of ammo adds are lalown in the 
art, and include, for example, x-ray crystallography and 2-dimensional nuclear magnetic 
resonance. 

Polynucleotides encoding the peptides. Polynucleotides can encode Mtb vacdne candidate 
peptides, including peptides fused to carrier proteins. Mtb vaccine candidate peptides can be 
encoded by either a synthetic or recombinant polynucleotide. The term "recombinant" refers to 
the molecular biological technology for combining polynucleotides to produce useful biological 
products, and to the polynucleotides and peptides produced by this technology. The 
polynucleotide can be a recombinant construct (such as a vector or plasmid) which contains the 
polynucleotide encoding the Mtb vaccine candidate peptide or fusion protdn under the operative 
control of polynucleotides encoding regulatory elements such as promoters, termination signals, 
and the like, "Operatively linked" refers to a juxtaposition wherein the components so described 
are in a relationship permitting them to function in their intended manner. A control sequence 
operatively linked to a coding sequence is ligated such that expression of the coding sequence is 
achieved under conditions compatible with the control sequences. "Control sequence" refers to 
polynucleotide sequences which are necessary to effect the expression of coding and non-coding 
sequences to which they are ligated. Control sequences generally include promoter, ribosomal 
binding site, and transcription termination sequence. In addition^ "control sequences" refers to 
sequences which control the processing of the peptide encoded within the coding sequence; these 
can include, but are not limited to, sequences controlling secretion, protease cleavage, and 
glycosylation of the peptide. The term "control sequences" is intended to include, at a minimum, 
components whose presence can influence expression, and can also include additional components 
whose presence is advantageous, for example, leader sequences and fiision partner sequences. A 
"coding sequence" is a polynucleotide sequence which is transcribed and translated into a 
polypeptide. Two coding polynucleotides are "operably linked" if the linkage results in a 
continuously translatable sequence without alteration or interruption of the triplet reading frame. 
A polynucleotide is operably linked to a gene expression element if the linkage results in the 
proper function of that gene expression element to result in expression of the Mtb vaccine 
candidate coding sequence. "Transformation" is the insertion of an exogenous polynucleotide 
{i.e., a "transgene") into a host cell. The exogenous polynucleotide is integrated within the host 
genome. A polynucleotide is "capable of expressing" a Mtb vacdne candidate peptide if it 
contains nucleotide sequences which contain transcriptional and translational regulatory 
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information and such sequences are "operably linked" to polynucleotide which encode the Mtb 
vaccine candidate peptide. A polynucleotide that encodes a peptide coding region can be then 
amplified, for example, by preparation in a bacterial vector, according to conventional methods, 
for example, described in the standard work Sambrook et aL, Molecular Cloning. A Laboratory 
Manual (Cold Spring Harbor Press 1989). Expression vehicles include plasmids or other vectors. 
Prokaryotic vectors known in the art include plasmids such as those capable of replication in jE. 
coli (such as, for example, pBE322, CoIEl, pSClOl, pACYC184, BVX). 

The polynucleotide encoding the Mtb vaccine candidate peptide can be prepared by 
chemical synthesis methods or by recombinant techniques. The polypeptides can be prepared 
conventionally by chemical synthesis techniques, such as described by Merrifield, J. Amer. Chem. 
Soc. 85:2149 (1963). See also, Stemmer etaL, Gene 164:49 (1995). Synthetic genes, the in vitro 
or in vivo transcription and translation of which will result in the production of the protein can be 
constructed by techniques well known in the art {see Brown et al.. Methods in Enzymology 
68:109 (1979)). The coding polynucleotide can be generated using conventional DNA 
synthesizing apparatus such as the Applied Biosystems Model 380A or 380B DNA synthesizers 
(commercially available from Applied Biosystems, Inc., 850 Lincoln Center Drive, Foster City, 
Calif 94404). 

Alternatively, systems for cloning and expressing Mtb vaccine candidate peptides include 
various microorganisms and cells which are well known in recombinant technology. These 
include, for example, various strains of E. coli. Bacillus, Streptomyces, and Saccharomyces, as 
well as mammalian, yeast and insect cells. Suitable vectors are known and available from private 
and public laboratories and depositories and from commercial vendors. See^ Sambrook etal., 
Molecular Cloning, A Laboratory Manual (Cold Spring Harbor Press 1989). See also PCT 
International patent application WO 94/01 139). These vectors permit infection of patient's cells 
and expression of the synthetic gene sequence in vivo or expression of it as a peptide or fiision 
protein in vitro. 

Polynucleotide gene expression elements useflil for the expression of cDNA encoding 
peptides include, but are not limited to (a) viral transcription promoters and their enhancer 
elements, such as the S V40 early promoter, Rous sarcoma virus LTR, and Moloney murine 
leukemia virus LTR; (b) splice regions and polyadenylation sites such as those derived from the 
SV40 late region; and (c) polyadenylation sites such as in SV40. Recipient cells capable of 
expressing the Mtb vaccine candidate gene product are then transfected. The transfected recipient 



wo 01/70774 



PCT/USOl/08906 



cells are cultured under conditions that permit expression of the Mtb vaccine candidate gene 
products, which are recovered from the culture. Host mammalian cells, such as Chinese Hamster 
ovary cells (CHO) or COS-1 cells, can be used. These hosts can be used in connection with 
poxvirus vectors, such as vaccinia or swinepox. Suitable non-pathogenic viruses, which can be 
engineered to carry the synthetic gene into the cells of the host include poxviruses, such as 
vaccinia, adenovirus, retroviruses and the like. A number of such non-pathogenic viruses are 
commonly used for human gene therapy, and as carrier for other vaccine agents, and are known 
and selectable by one of skill in the art. The selection of other suitable host cells and methods for 
transformation, culture, amplification, screening and product production and purification can be 
performed by one of skill in the art by reference to known techniques (see, e.g., Gething & 
Sambrook, Nature 293:620 (1981)). Another preferred system includes the baculovirus 
expression system aiid vectors. 

The polynucleotide encoding the Mtb vaccine candidate peptide can be used in a variety of 
ways. For example, a polynucleotide can express the Mtb vaccine candidate peptide m vitro in a 
host cell culture. The expressed Mtb vaccine candidate peptide immunogens, after suitable 
purification, can then be incorporated into a pharmaceutical reagent or vaccine. 

Alternatively, the polynucleotide encoding the Mtb vaccine candidate peptide immunogen 
can be administered directly into a human as so-called "naked DNA" to express the peptide 
immunogen in vivo in a patient, (see, Cohen, Science 259:1691 (1993), Fynan etaL, Proc. Natl. 
Acad. Sci. USA, 90:11478 (1993); mdWdiSetaL, BioTechniques 11:474 (1991)). The 
polynucleotide encoding the Mtb vaccine candidate peptide immunogen can be used for direct 
injection into the host. This results in expression of the Mtb vaccine candidate peptide by host 
cells and subsequent presentation to the immime system to induce anti-Mtb antibody formation in 
vivo. 

Determinations of the sequences for the polynucleotide coding region that codes for the 
Mtb vaccine candidate peptides described herein can be performed using commercially available 
computer programs, such as DNA Strider and Wisconsin GCQ. Owing to the natural degeneracy 
of the genetic code, the skilled artisan will recognize that a sizable yet defmite number of DNA 
sequences can be constructed which encode the claimed peptides (see, Watson et aL, Molecular 
Biology of the Gene, 436-437 (the Benjamin/Cummings Publishing Co. 1987)). 

Treatment of Mtb infection The method for reducing the levels of Mtb involves exposing 
a human to a Mtb vaccine candidate peptides, actively inducing antibodies that react with Mtb, 
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and impairing the multiplication of Mtb in vivo. This method is appropriate for anMtb infected 
subject with a competent immune system, or an uninfected or recently infected subject. The 
method induces antibodies, which react with Mtb, which reduces multiplication during any initial 
acute infection with Mtb. 

The terms 'treating," ''treatment," and the like are used herein to mean obtaining a desired 
pharmacologic or physiologic eflFect. The effect can be prophylactic in terms of completely or 
partially preventing a disorder or sign or symptom thereof, or can be therapeutic in terms of a 
partial or complete cure for a disorder and/or adverse effect attributable to the disorder. 
"Treating" as used herein covers any treatment and includes: (a) preventing a disorder from 
occurring in a subject that can be predisposed to a disorder, but has not yet been diagnosed as 
having it; (b) inhibiting the disorder, /.e., arresting its development; or (c) relieving or ameliorating 
the disorder. An "effective amount" or 'therapeutically effective amount" is the amount suflBcient 
to obtain the desired physiological effect. An effective amount of the Mtb vaccine candidate 
peptide or vector expressing Mtb vaccine candidate peptides is generally determined by the 
physician in each case on the basis of factors normally considered by one skilled in the art to 
determine appropriate dosages, including the age, sex, and weight of the subject to be treated, the 
condition being treated, and the severity of the medical condition being treated. Among such 
patients suitable for treatment with this method are Mtb infected patients. 

Method of administration. Mtb vaccine candidate peptides can be administered in a variety 
of ways, orally, topically, parenterally e.g. subcutaneously, intraperitoneally, by viral infection, 
intravascularly, etc. Depending upon the maimer of introduction, the Mtb vaccine candidate 
peptides can be formulated in a variety of ways. The concentration of Mtb vaccine candidate 
peptides in the formulation can vary from about 0. 1-100 wt.%. 

The amount of the Mtb vaccine candidate peptide or polynucleotides of the invention 
present in each vaccine dose is selected with regard to consideration of the patient's age, weight, 
sex, general physical condition and the like. The amount of Mtb vaccine candidate peptide 
required to induce an immune response, preferably a protective response, or produce an 
exogenous effect in the patient without significant adverse side effects varies depending upon the 
pharmaceutical composition employed and the optional presence of an adjuvant. Generally, for the 
compositions containing Mtb vaccine candidate peptide, each dose will comprise between about 
50 lag to about 1 mg of the Mtb vaccine candidate peptide immunogens/ml of a sterile solution. A 
more preferred dosage can be about 200 ^lg of Mtb vaccine candidate peptide immunogen. Other 
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dosage ranges can also be contemplated by one of skill in the art. Initial doses can be optionally 
followed by repeated boosts, where desirable. The method can involve chronically administering 
the Mtb vaccine candidate peptide composition. For therapeutic use or prophylactic use^ repeated 
dosages of the immunizing compositions can be desirable, such as a yearly booster or a booster at 
other intervals. The dosage administered will, of course, vary depending upon known factors such 
as the pharmacodynamic characteristics of the particular agent, and its mode and route of 
administration; age, health, and weight of the recipimt; nature and extent of symptoms, kind of 
concurrent treatment, frequency of treatment, and the effect desired. Usually a daily dosage of 
active ingredient can be about 0.01 to 100 mg/kg of body weight. Ordinarily 1.0 to 5, and 
preferably 1 to 10 mg/kg/day given in divided doses 1 to 6 times a day or in sustained release 
form is effective to obtain desired results. 

The Mtb vaccine candidate peptide can be employed in chronic treatments for subjects at 
risk of acute infection. A dosage frequency for such "acute" infections may range from daily 
dosages to once or twice a week intravenously or intramuscularly, for a duration of about 6 
weeks. The peptides can also be employed in chronic treatments for infected patients. In infected 
patients, the frequency of chronic administration can range from daily dosages to once or twice a 
week i.v, or i.m., and may depend upon the half-life of the immunogen (e.g., about 7-21 days). 
However, the duration of chronic treatment for such infected patients is anticipated to be an 
indefinite, but prolonged period. 

For such therapeutic uses, the Mtb vaccine candidate peptide formulations and modes of 
administration are substantially identical to those described specifically above and can be 
administered concurrently or simultaneously with other conventional therapeutics. 

Immunologically acceptable carrier. Mtb vaccine candidate peptides can be administered 
either as individual therapeutic agents or in combination with other therapeutic agents. Mtb 
vaccine candidate peptides can be administered alone, but are generally administered with a 
pharmaceutical carrier selected on the basis of the chosen route of administration and standard 
pharmaceutical practice. The vaccine can further comprise suitable, i.e., physiologically 
acceptable, carriers— preferably for the preparation of injection solutions—and further additives as 
usually applied in the art (stabilizers, preservatives, etc.), as well as additional drugs. The patients 
can be administered a dose of approximately 1 to 10 ^ig/kg body weight, preferably by 
intravenous injection once a day. For less threatening cases or long-lasting therapies the dose can 
be lowered to 0.5 to 5 ng/kg body weight per day. The treatment can be repeated in periodic 
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intervals, e.g:, two to three times per day, or in daily or weekly intervals, depending on the status 
of Mtb infection or the estimated threat of an individual of getting Mtb infection. 

For parenteral administration, peptides of the invention can be formulated as a solution, 
suspension, emulsion or lyophilized powder in association with a pharmaceutically acceptable 
parenteral vehicle. Examples of such vehicles are water, saline, Ringefs solution, deictrose 
solution, and 5% human serum albumin. Liposomes and nonaqueous vehicles such as fixed oils 
can also be used. The vehicle or lyophilized powder can contain additives that maintain isotonicity 
(e.g., sodium chloride, mannitol) and chemical stability (e.g;, buffers and preservatives). The 
formulation is sterilized by commonly used techniques. Suitable pharmaceutical carriers are 
described in the most recent edition of Remington's Pharmaceutical Sciences, a standard 
reference text in this field of art. For example, a parenteral composition suitable for administration 
by injection is prepared by dissolving 1.5% by weight of active ingredient in 0.9% sodium chloride 
solution. The preparation of these pharmaceutically acceptable compositions, having appropriate 
pH isotonicity, stability and other conventional characteristics is within the skill of the sirt. 

The vaccine composition can include as the active agents, one of the following above- 
described components: (a) an Mtb vaccine candidate peptide immunogen, which can be in the 
form of recombinant proteins or, alternatively, can be in the form of a mixture of carrier protein 
conjugates; (b) a polynucleotide encoding a Mtb vaccine candidate; (c) a recombinant virus 
carrying the synthetic gene or molecule; and (d) a bacteria carrying the Mtb vaccine candidate. 
The selected active component is present in a pharmaceutically acceptable carrier, and the 
composition can also contain iadditional ingredients. 

Formulations containing the Mtb vaccine candidate peptide can contain other active 
agents, such as adjuvants and immunostimulatory c)rtokines, such as IL-i2 and other well-knoAm 
cytokines, for the peptide compositions. 

Suitable pharmaceutically acceptable carriers for use in an immunogenic composition are 
well known to those of skill in the art. Such carriers include, for example, saline, a selected 
adjuvant, such as aqueous suspensions of aluminum and magnesium hydroxides, liposomes, oil in 
water emulsions, and others. 

Carrier protein. Mtb vaccine candidate peptide immunogens can be linked to a suitable 
carrier in order to improve the efiBcacy of antigen presentation to the immune system. Such 
carriers can be, for instance, organic polymers. A carrier protein can enhance the immunogenicity 
of the peptide immunogen. Such a carrier can be a larger molecule, which has an adjuvant effect. 
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Exemplary conventional protein carriers include, keyhole limpet hemocyan, E, coli DnaK protein, 
galactokinase (galK, which catalyzes the first step of galactose metabolism in bacteria), ubiquitin, 
a-mating factor, P-galactosidase, and influenza NS-1 protein. Toxoids (/.e., the sequence which 
encodes the naturally occurring toxin, with suflScient modifications to eliminate its toxic activity) 
such as diphtheria toxoid and tetanus toxoid can also be employed as carriers. Similarly a variety 
of bacterial heat shock proteins, e.g,, mycobacterial hsp-70 can be used. Glutathione reductase 
(GST) is another useful carrier. One of skill in the art can readily select an appropriate carrier. 

Viruses can be modified by recombinant DNA technology such as, e.g. rhinovirus, 
poliovirus, vaccinia, or influenzavirus, etc. The peptide can be linked to a modified, r.e., 
attenuated or recombinant virus such as modified influenza virus or modified hepatitis B virus or 
to parts of a virus, e,g,, to a viral glycoprotein such as, e.g., hemagglutinin of influenza virus or 
surface antigen of hepatitis B virus, in order to increase the immunological response against Mtb- 
infected cells. 

The Mtb vaccine candidate peptides can be in fiasion proteins, wherein they are linked to a 
suitable carrier which might be a recombinant or attenuated virus or a part of a virus such as, e.g. , 
the hemagglutinin of mfluenza virus or the surface antigen of hepatitis B virus, or another suitable 
carrier including other viral surface proteins^ e.g, surface proteins of rhinovirus, poliovirus, 
sindbis virus, coxsackievirus, etc., for eflScient presentation of the antigenic site(s) to the immune 
system. In some cases, the antigenic firagments might, however, also be purely, i.e., without 
attachment to a carrier, applied in an analytical or therapeutical program. 

Naked DNA vaccine. Alternatively, polynucleotides can be designed for direct 
administration as "naked DNA". Suitable vehicles for direct DNA, plasmid polynucleotide, or 
recombinant vector administration include, without limitation, saline, or sucrose, protamine, 
polybrene, polylysine, polycations, proteins, calcium phosphate, or spermidine. See e.g^ PCT 
International patent application WO 94/01 139. As with the immunogenic compositions, the 
amounts of components in the DNA and vector compositions and the mode of administration, 
e.g^ injection or intranasal, can be selected and adjusted by one of skill in the art. Generally, each 
dose will comprise between about 50 fig to about 1 mg of immunogen-encoding DNA per ml of a 
sterile solution. 

For recombinant viruses containing the coding polynucleotide, the doses can range fi-om 
about 20 to about 50 ml of saline solution containing concentrations of from about 1x10^ to 
1x10^° pfu/ml recombinant virus of the invention. One human dosage is about 20 ml saline 
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solution at the above concentrations. However, it is understood that one of skiil in the art can 
alter such dosages depending upon the identity of the recombinant virus and the make-up of the 
inununogen that it is delivering to the host. 

The amoimts of the commensal bacteria carrying the synthetic gene or molecules to be 
delivered to the patient will generally range between about 10^ to about 10^^ cells/kg. These 
dosages, will of course, be altered by one of skill in the art depending upon the bacterium being 
used and the particular composition containing immunogens being delivered by the live bacterium. 

Antibodies. An antibody directed against an Mtb vaccine candidate peptide is also an 
aspect of this invention. Polyclonal antibodies are produced by immuniang a mammal with a 
peptide immimogen. Suitable mammals include primates, such as monkeys; smaller laboratory 
animals, such as rabbits and mice; as well as larger animals, such as horse, sheep, and cows. Such 
antibodies can also be produced in transgenic animals. However, a desirable host for raising 
polyclonal antibodies to a composition of this invention includes humans. The polyclonal 
antibodies raised are isolated and purified jfrom the plasma or serum of the immunized mammal by 
conventional techniques. Conventional harvesting techniques can include plasmapheresis, among 
others. Such polyclonal antibodies can themselves be employed as pharmaceutical compositions of 
this invention! Alternatively, other forms of antibodies can be developed using conventional 
techniques, including monoclonal antibodies, chimeric antibodies, humanized antibodies and fijUy 
human antibodies See, e.g.. United States patent 4,376,110; Ausubel etaL, Ctirrent Protocols in 
Molecular Biology (Greene Publishing Assoc. and Wiley Interscience, N.Y., 1992); Harlow & 
Lane, Antibodies: a Laboratory Manual, (Cold Spring Harbor Laboratory, 1988); Queen^^ al, 
Proc. Natl. Acad. Sci. USA 86:10029 (1989); Hodgson et aL, Bio/Technology 9:421 (1991); 
PCT Intemational patent application WO 92/04381 and PCT International patent application WO 
93/20210. Other antibodies can be developed by screening hybridomas or combinatorial libraries, 
or antibody phage displays (Huse etal, Science 246: 1275 (19S8)) using the polyclonal or 
monoclonal antibodies produced according to this invention and the amino acid sequences of the 
primary or optional immunogens. 

The term "antibody" includes polyclonal antibodies, monoclonal antibodies (mAbs), 
chimeric antibodies, anti-idiotypic (anti-Id) antibodies to antibodies that can be labeled in soluble 
or bound form, as well as fragments, regions or derivatives thereof, provided by any known 
technique, such as, but not limited to en2ymatic cleavage, peptide synthesis or recombinant 
techniques. An "antigen binding region" is that portion of an antibody molecule which contains 
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the amino acid residues that interact with an antigen and confer on the antibody its specificity and 
aflOnity for the antigen. The antibody region includes the fi-amework amino acid residues necessary 
to mmntain the proper conformation of the antigen-binding residues. 

Computer Implementation. Aspects of the invention may be implemented in hardware or 
software, or a combination of both. However, preferably, the algorithms and processes of the 
invention are implemented in one or more computer programs executing on programmable 
computers each comprising at least one processor, at least one data storage system (including 
volatile and non-volatile memory and/or storage elements), at least one input device, and at least 
one output device. Program code is applied to input data to perform the functiotis described 
herein and generate output information. The output information is applied to one or more output 
devices, in known fashion. 

Each program may be implemented in any desired computer language (including machine, 
assembly, high level procedural, or object oriented programming languages) to communicate with 
a computer system. In any case, the language may be a compiled or interpreted language. 

Each such computer program is preferably stored on a storage media or device {e.g., 
ROM, CD-ROM, tape, or magnetic diskette) readable by a general or special purpose 
programmable computer, for configuring and operating the computer when the storage media or 
device is read by the computer to perform the procedures described herein. The inventive system 
may also be considered to be implemented as a computer-readable storage medium, configured 
with a computer program, where the storage medium so configured causes a computer to operate 
in a specific and predefined manner to perform the fijnctions described herein. 

The following EXAMPLES are presented in order to more fully illustrate the preferred 
embodiments of the invention. These examples should in no way be construed as limiting the 
scope of the invention, as defined by the appended claims. 

EXAMPLE 1 

PREDICTING T-CELL EPITOPES WTTHIN Mtb PROTEIN ANTIGENS 
Mtb Antigens. The EpiMer algorithm was applied for Mtb protein antigens, all of which 
were selected for analysis because they had been previously shown to stimulate proliferative 
responses in Mtb infected subjects. The purpose of this study was to prospectively confirm the 
utility of the EpiMer algorithm, by (1) measuring the response of Mtb immune subjects to 
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EpiMer-predicted peptides containing clusters of MHC-binding motifs, and by (2) measuring 
individual responses to other peptides containing motifs matched to the subjects' HLA-DR allele. 

Four MW protein antigens were studied: 14 kDa, 16 kDa, 19 kDa, and 32 kDa. The 14 
kDa protein, also known as MTP40, is unique to Mtb (See Parra, et aL, Infect and Imniun 
59:341 1 (1991); Falla, et al. Infect and Immun 59:2285 (1991)). Falla et al have identified both 
B and T cell epitopes within this protein. The 16 kDa protein, the major protein a;ssociated with 
membrane preparations of Mtb, has approximately 30% homology with the alpha-crystallin family 
of low molecular weight heat shock proteins {See Lee, etah^ Infect and Immun 60:2285 (1991); 
Verbon, etal.^ JBacteriol 174:1352 (1992)), Vordermeier, etal, have identified both murine and 
human T cell epitopes within this protein using the overlapping peptide method {See Vordermeier, 
etal.^ Immunology 79:8 (1993); Lamb, ei cd., Eur J Immunol 18:973 (1988)). The 19 kDa 
antigen has been shown to contain both human and murine T cell epitopes in a number of studies 
See Lamb, et al, Eur J Immunol 18:973 (1988); Ashbridge, etal, J Immunol 148:2248 (1992); 
Faith, et al.. Immunology 74: 1 (1991); Rees, et al. Immunology 80:407 (1993); Harris, et al, J 
Immunol 150:407 (1993). The 32 kDa protein, also known as Antigen 85A is one of a number of 
secreted proteins referred to as the Antigen 85 complex. See Wiker, et al, Microbio Rev 56:648 
(1992). 

Secreted antigens appear to be a primary target of the protective immune response to Mtb 
because they are believed to be more readily available to macrophages for antigen processing and 
peptide presentation, leading to a strong T cell response. See Boesen, et al. Infect and Immun 
63 : 1491 (1995). Studies by Huygen et al and Launois et al have described both murine and 
human T cell epitopes within the 32 kDa antigen {See Huygen, et al. Infect and Immun 82:363 
(1994); Launois, etal. Infect and Immun 62:3879 (1994)). 

Epitope predictions. Amino acid sequences for the four proteins studied were obtained 
fi-om the Protein Identification Resource (National Library of Medicine) on-line database; these 
were A43589 (14 kDa Mtb antigen); A43823 (16 kDa Mtb antigen); 802753 (19 kDa Mtb 
antigen); and A3 7024 (32 kDa antigen). The EpiMer algorithm uses MHC-binding motifs to 
generate motif matches from the amino acid sequence of a protein. By stepping a reading frame of 
length r (set to 1 1 for these experiments) one amino acid at a time through the protein primary 
structure, the algorithm deternunes the motif density J for each peptide of length r within the 
protein. Using a minimum density value dmin^ set to the sum of the protein's mean MHC-binding 
motif density d plus one standard deviation, EpiMer extracts only those motif-dense 'clusters' 
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with cEcbnin, Finafly, the algorithm uses a 'threading value' r, of 10, to link selected clusters of 
contiguous segments into single peptides, depending on their distance apart in the amino acid 
sequence. (As an example, ^ = 10 would assure that motif-rich clusters from one to ten amino 
acids apart would be linked into the same predicted peptide, but that clusters of eleven or more 
amino acids apart would not be linked into a single prediction. The technique of threading was 
implemented to avoid the generation of multiple peptides overlapping the same short region of a 
protein. These clusters of MHC-binding motifs constitute the EpiMer algorithm's predictions for 
putative T cell epitopes for a protein. A fiill description of the method has been published 
(Meister, etal,. Vaccine 13:581 (1995)). 

The EpiMer algorithm was executed in Microsoft Excel v4.0 and v5.0 (Microsoft 
Corporation, Redmond, WA) using a Macintosh Quadra 650 and a PowerMacintosh 7100 (Apple 
Computer, Inc., Cupertino, CA). Two versions of EpiMer were used in these experiments: 
EpiMer ML 1994 and EpiMer ML 0595, The EpiMer ML 1994 motif database contains a total of 
15 distinct class II MHC-binding motifs, as described previously (Meister^ etaL^ Vaccine 13:581 
(1995)). This version of EpiMer was used in 1994 to predict the peptides for the experiments. 
Over the course of time, more MHC-binding motifs were published; EpiMer ML 0595 replaced 
the earlier version of EpiMer in May, 1995 (Roberts, et al, AIDS Res Hu Retrovir 7:593 (1996)). 
EpiMer ML 0595 employs a more extensive motif list and modifications of the original motifs, 
described in Roberts, et al . 

Peptide synthesis. A total of 23 putative epitopes were predicted by the EpiMer ML 1994 
algorithm from the four Mtb protein antigens studied. In those cases where EpiMer-predicted 
sequences were greater than 20 amino acids in length, overlapping peptides were identified that 
spanned the given EpiMer prediction. This was necessary due to difficulties encountered in the 
synthesis of peptides greater than 20 amino acids in length. Four cases arose in which the 
synthesis of overlapping peptides was necessary. In three of these cases a 20 amino acid peptide 
was generated from the first 20 amino acids of the predicted epitope's sequence, and another with 
the sequence of the last 20 amino acids of the same prediction (see peptides 14-2,14-3 [overlap of 
9 amino acids]; 14-5,14-6 [overlap of 14 amino acids]; and 16-1,16-2 [overlap of 14 amino 
acids]). In one case, an EpiMer-prediction included 42 amino acids; three overlapping 20 amino 
acid peptides were synthesized to span the length of this predicted putative epitope (peptides 32- 
5, 32-6, 32-7). A total of 28 peptides were synthesized to correspond to the 23 putative epitopes 
predicted by EpiMer at the Torrey Pines Institute (San Diego, CA), using the simultaneous 
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multiple peptide synthesis method, employing t-butoxycarbonyl amino acids and hydrogen 
fluoride cleavage. The purity of these synthetic peptides was determined by high-pressure liquid 
chromatography and mass spectroscopy. 

EXAMPLE2 

MEASURING RESPONSE OF Mtb IMMUNE SUBJECTS TO EpiMer-PREDICTED 

PEPTIDES 

Subjects. Forty purified protein derivative (PPD) skin-test positive subjects (designated 
Mtb immune for this study) were recruited firom the Roger Williams Hospital Tuberculosis Clinic, 
Providence, RI; the Lemuel Shattuck Hospital, Jamaica Plain, MA; and the Research Institute for 
Tropical Medicine, Manila, the Philippines. AH of these subjects had skin induration greater than 
10 mm at 48 hours post inoculation with 5 U.S. Units (TU) PPD (Connaught Laboratories, 
Ontario, Canada), and all were healthy adults (ages ranged from 1 8 to 66 years) with no 
radiologic or clitiical sign of active TB. 

Nineteen PPD skin-test negative subjects were recruited to serve as controls (Mtb naive 
group). For this group of subjects, the ages ranged fi-om 21 to 46 years. None of these subjects 
had been immunized with the BCG vaccine. None of the subjects in either group gave a history of 
TB exposure, HIV infection, or of taking immunosuppressive medications. IRB approval was 
obtained from the Roger Williams Hospital (TB Clinic); Lemuel Shattuck Hospital; Brown 
University; and the Research Institute for Tropical Medicme, Manila, the PhiUppines, and all study 
subjects gave informed consent to participate in this study. 

Cell isolation. Whole blood was collected in heparinized Venoject tubes (Curtin- 
Matheson Scientific. Wilmington, MA). Peripheral blood mononuclear cells (PBMG) were 
isolated by centrifugation over Histopaque 1.077 (Sigma Chemical Corp., St. Louis, MO). Cells 
were suspended m RPMI 1640 medium (JRH Biosciences, Lenexa, KS) supplemented with 
2 mM L-glutamine (Sigma), 5 |ig/ml of cefazolin (Sigma), and 10% heat-inactivated human AB 
serum (Sigma). 

Lymphoproliferation assay. Assays were performed in 96-well round bottom plates 
(Corning-Costar Corp., Cambridge, MA). Triplicate cultures containing 2 x 10^ PBMC in 0,2 ml 
culture medium, with or without peptide, were incubated for 4 days at 37 °C in a 5% CO2- 
enriched, humidified atmosphere. After this 4 day period, 1 p.Ci of [^HJthymidine (ICN 
Biomedicals, Costa Mesa, CA) was added to each well for an additional 14 hours of incubation. 



-17- 



wo 01/70774 



PCT/USOl/08906 



Responses to several control antigens including PPD (5 pg/ml) (Connaught Laboratories, Inc., 
Swiftwater, PA); tetanus toxoid (TT) (5 jig/ml) (Connaught); phytohemaglutinin (PHA), a 
nonspecific T cell mitogen, (1 ng/ml) (Sigma); and to each of the 28 EpiMer-predicted peptides 
(1 =g/ml and 10 =g/ml) were measured. After incubation with [^HJthymidine, the 96-well plates 
were harvested onto fiberglass filtermats using a 96-well semiautomated Tomtec Mach IIM 
harvester (Tomtec, Groton, CT). Counts per minute (cpm) were measured in aBetaplate 1205 
scintillation counter (Wallac, Turku, Finland). Stimulation indices (SI) were calculated as follows: 

SI = mean (cpm in wells containing peptide or antigen) 

mean (cpm in wells containing medium and cells alone) 

Responses to peptides were graded as positive if SI > 2.0. 

Statistical analysis. All statistical analyses were performed in Microsoft Excel v4.0 or 
v5.0 (Microsoft Corporation, Redmond, WA) using a Macintosh Quadra 650 or a 
PowerMacintosh 7100 (Apple Computer, Inc., Cupertino, CA). The relative risk (RR) of 
response and the 95 % confidence intervals (CI) for this ratio were calculated using an Excbl v4.0 
spreadsheet. 

Results. 

Epitope predictions. EpiMer-predicted epitopes are shown in Tables 1 and 2. Table 3 Ksts 
26 distinct MHC-binding motifs described for the human leukocyte antigen (HLA) class n alleles 
and included in the EpiMer ML 0595 motif list at the time these experiments were initiated. In 
some cases, multiple, distinct MHC-binding motifs have been published for the same HLA allele. 
In both versions of EpiMer used here, each match to a motif was counted separately and equally. 
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Table 2 



Peptide 


Start 


Stop 


Sequence 


# aa*s 


Alleles Represented (Putative) 


tfMbtif 
s(ML 
0595)^ 


16-4* 


74 


84 


LTDCAERTEQK (SEQ ID 
NO:10) 


11 


- 


0 


32-3* 


67 


82 


S WMPVGGQSSFYSDW (SEQ 
IDNO:20^ 


16 


- 


0 


32-9 


227 


242 


KFLEGFVRTSNIBCFQD (SEQ 


16 


- 


0 


14-4 


56 


68 


QAVMLGTOTPNRA (SEQ ID 
NO:4) 


13 


DRl(c) 




16-3 


54 


66 


AELPGVDPDKDVD (SEQ ID 
NO:9) 


13 


DRB5*0101(a) 




19-4 


107 


117 


VTLGYTSGTGQ (SEQ ID 
NO:16) 


11 


DRBl *0404(DR4Dwl4) 




32-10 


252 


264 


GVFDFPDSGTHSW (SEQ ID 

NO:27) 


13 


DRl(c) 


1 


32-2 


35 


47 


ALYLLDGLRAQDD (SEQ ID 
]SIO:19) 


13 


HLA-DRB1*0101(R) 




16-6* 


119 


135 


GJLTVS VAVSEGKPTEK (SEQ 
ID NO: 12) 


17 


DQ3.Z,DQ7, 
DRB 1 *040 ld)R4Dw4)(a) 


3 


9-5 


127 


139 


SHYKTTGTATGVD (SEQ ID 
NO:17) 


13 


DP9,DPw4(bXDQ3.1 


3 


32-11 


265 


276 


EYWGAQLNAMKP (SEQ ID 
NO:28^ 


12 


DPA1*0102/DPB1 *0201 
D03.1,DRl(c) 


3 . 


32-4 


99 


110 


TFLTSELPGWLQ (SEQ ID 
NO:21) 


12 


HLA-DRird DRR 1*0^01 


3 


14-6* 


80 


99 


VSETISGPRLYGEMTMQGTR 
(SEQIDNO:6) 


20 


DQ7, 

DRB 1 *0404(DR4Dwl4), 
DRB1*0801 


4 


32-1 


6 


20 


LPVEYLQVPSPSMGR (SEQ 
IDNO:18) 


15 


DPAl *0102/DPB 1 *020 1 , 
DRl (c), DRB 1*0101 (R), 
DRB1*0701 


4 


14-2 


19 


38 


GSFGSAPSNGWLKLGLVEFG 
(SEQIDNO:2) 


20 


DQ3. 1 , DRl(c), DRB 1 *0701 


5 


14-5* 


74 


93 


CEVWSNVSEHSGPRLYGEM 
(SEQIDNO:5) 


20 


DQ3.1,DRl(c), 
DRB1*0101(R), DRB1*0301, 
DRB1*0701, DRB1*0801 




16-1 


11 


30 


RSLFPEFSELFAAFPSPAGL 
(SEQIDNa?) 


20 


DRB1*0701 DRBl*0101/'R'i 

Jm^M\ 4, V C V Jl g X^J-\J^ X V X V X «i #9 

DRl(c), DPw4(b) 


5 


16-2* 


17 


36 


FSELFAAKPSFAGLRFIKDT 
(SEQIDNO:8) 


20 


DPw4(b), DQ3.1, DRl(c), 
DRB1*0101(R) 


6 


19-2* 


58 


74 


QNVTGSWCTTAAGNVN 
(SEQIDNO:14) 


17 


DP\v4(b), HLA-DQ3.1, HLA- 
DRl(c) 


6 


32-7* 


135 


154 


LAIYHPQQFVYAGAMSGLLD 
(SEQ 3D NO:24) 


20 


DPw4(b), DQ3.1.DRl(c), 


7 


14-1* 


1 


18 


VDLGNAPSWPNTILGMHC 
(SEQ ID NO:l) 


18 


DQ3.1,DRl(c), 
DRB 1 *0 1 0 1 (RX DRB 1 *0301 , 
DRB 1*0701 


8 


16-5* 


92 


107 


FAYGSFVRTVSLPYGA (SEQ 
IDNO:ll) 


16 


DPw4(b), DQ3.1,DQ7, DRl(c), 
DRBl *040 1 (DR4Dw4)(a), 
DRB 1*0701 


8 


32-8* 


193 


210 


PIXNVGKLIANNTRVWVY 
(SEQIDNO:25) 


18 


DQ3.1,DRl(c), 
DRBl *0404(DR4Dwl4X 
DRBl*070l, DRB1*0801, 
DRBl*1501,DPw4(b) 


8 


14-3* 


30 


49 


LKLGLVEFGGVAKLNAEVMS 
(SEQIDNO:3) 


20 


DPw4(b), DQ3.1, HLA-DRl(c), 
DRB1*0101(R), DRB1*110U 


9 
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DRB1*1201,DKB1*1501 




32-5* 


113 


132 


RHVKPTGSAWGLSlwIAASSA 
(SEQIDNO:22) 


20 


T>Q3A,DRl(cX 
DRB 1 *040 l(DR4Dw4)(a), 
DRBl*0404(DR4dDwl4), 
DRBl*0701,DPw4(b) 


9 


19-3* 


75 


94 


lAIGGAATGIAAVLTDGKPP 


20 


DPw4(bX DQ3.1, DRl(c), 


12 


19-1* 


7 


25 


VAVAOAAILVAGLSGCSSN 
(SEQIDNO:I3) 


19 


DPw4(b), DQ3 . 1 , DRl (c), 

DRB1*0101(R), 
DRBl *0401(DR4Dw4Xa) 


16 


32-6* 


23 


142 


VGLSMAASSALILAIYHPQQ 
CSEQIDNO:23) 


20 


DPw4(bX DQ3.1,DRl(cX 

DRB1*0101(R), 
DRBl*0401(DR4Dw4)(a). 
DRBl*0404(DR4Dwl4) 


16 



V = number of motifs is detennined by counting the absolute number of MHC binding motifs, including reiterated motifs 
* = 1 5 BpiMer peptides that coirelate with published epitopes 



Table 3 



Class II MHC Binding Motrfe utilized in Epimer Mofif 
List 0595 


















Motif 
Name 


ReferenceCs) 


Position in peptide 


1+0 


1+1 


1+2 


1+3 


1+4 


1+5 


1+6 


1+7 


1+8 


1+9 


1+10 


DP9 


Dong et al., J 

Immunol 
154:4536-4545 
(1995) 


RK 










AGl 






LV 






HLA- 
DPAl* 
0102/ 
DPBl* 

0201 


Raiimieiisec et al. 
ImmutiogeneticS 
41.178-228 (1995) 


FLM 

VW 
T 








FL 
MY 






lAMV 








HLA- 
DPw4 
(b) 


Falk et al., 
Immunogenetics 
39: 230-242 
(1994) 


FLY 
MIV 

A 












FLY 
MVIA 






VYIA 
L 




HLA- 
D02 


Verrecketal,, EJI 
24:375-379 (1994) 


K 






I 










F 






HLA- 
DQ3.1 


Sidney et al., J. 

Immunol. 
152:4518-4525 
(1994) 


No 
UK 
DEP 


No 
RK 
DE 


AG 
ST 


No 
DE 


AV 
LI 














HLA- 
DQ7 


Falketal., 
Immunogenetics 
39:230-42(1994) 


FYI 
ML 
Y 








VLI 
MY 




YFM 
LVl 










HLA- 
DRl 

(a) 


Hammer et al., X 
Exp. Med. 176: 
1007-1013 (1992) 


YF 
W 


No 
DE 


No 
DE 


ML 


No 
DE 


GA 


No 
DE 


No 
DE 


LMAI 
GTVQ 
S 






HLA- 
DRl 
(P) 


Kropshofer et aL, J. 
Exp. Med. 
175:1799-1803 
(1992) 


AVI 
LYF 
WM 
C 










STA 
VIL 
PQ 






AVILY 
FWMC 






HLA- 
DRl 
(d) 


Hammer et al, 
PNAS USA 91 
(10); 4456-4460 
(1994) 


YW 
FIL 
VM 


R 




ML 


GA 






L 








HLA- 
DR3 
(b) 


Sidney et al., J. 

Immunol. 
149:2634-2640 
(1992) 


AVI 
LYF 
WM 
C 




AV 

IL 

YF 

W 

MC 


QN 
RK 
DES 
T 




RK 
H 












HLA- 

DRBl 
*0101 


Rammaxsee et aJ<, 
Immunogenetics 
41:178-228(1995) 


YVL 

FIA 

MW 






LAI 
VM 
NQ 




AGS 
TP 






LAIVN 
FYM 
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HLA- 
DRBl 
*0301 


Corrected. Chicz et 
al., J. Exp. Med. 
178:27-47(1993) 


YF 

WU 

VM 






DEN 










YMLI 






HLA- 
DRBl 
*0301 
(a) 


Malcherek et al., 
Intl. ImmiuioL 
5:1229-1237 
(1994); stretch 
variant of 
Ranunensee et al 
(1995) 


LIF 
MV 






D 




KRE 
QN 




LYF 








HLA- 
DRBl 
*0301 
(b) 


Malcherek etal., 
Intl. Immunol. 
5:1229-1237 
(1994); Also 
Ranjmensee et al. 
(1995) 


LIF 
MV 






D 




KRE 
QN 






LYF 






HLA- 
DRBl 
*0301 
(P> 


Malcherek etal., 
Intl. Immunol. 
5:1229-1237 
(1994); stretch 
variant of 
. Raitmiensee et al. 
(1995) 


LIF 

MV 






D 




KRE 
QN 








LYF 




HLA- 
DRBl 
*0401 
(Dr4D 
w4(a)) 


Rammensee et al,, 
Immunogenetics 
41:178-228 (1995) 


FY 
WIL 

VM 
GA 






FWI 
LVA 
DE 
M 




NST 
QH 
RVL 
IM 


DAS 

VHPL 

NMI 




ASQG 
LTVK 






HLA- 
DRBl 
*0402 
(DR4D 

wlO) 


Rammensee et al., 
Immunogenetics 
41:178-228 (1995) 


YIL 
M 






No 
DE 




NQS 
TK 


RKH 
NQP 




AHGQ 
SNLT 
V 






HLA- 
DRBl 
*0404 
(DR4D 
wl4) 


Rammensee et al,, 
Immunogenetics 
41:178-228(1995) 


VIL 
M 






No 
RK ' 




NTS 
QR 


ANV 
QKP 
DMS 
HLIT 




ASQG 
LTVK 






HLA- 
DREl 
*0405 
CDR4D 
wl5) 
(a) 


Rammonsee et al, 
bnmunogenetics 
41:178-228(1995) 


FY 

WVI 

LM 






VIL 
MD 
E 




NTS 

QK 

DV 


ANV 
QKP 
DMS 
HLIT 




DEQ 






HLA- 
DRBl 
*0701 


Corrected. Chicz et 
al., J. Exp. Med. 
178:27-47(1993) 


WY 
FM 
VLI 










TS 






WYFM 
LVI 






HLA- 

DRBl 
*0801 


Corrected Chicz et 
a!., J. Exp. Med. 
178:27-47(1993) 


YF 
MV 
LI 








KR 














HLA- 
DRBl 
*1101 


Rammensee et al,, 
Immunogenetics 
41:178-228(1995) 


WY 
F 






MLI 
V 




RK 












HLA- 
DHBl 
*1201 


Rammensee et al., 
Immunogenetics 
41:178-228 (1995) 


ILF 

YV 




LM 
NV 
A 






VYF 
INA 






YFMIV 






HLA- 

DRBl 
* 1501 


Corrected Chicz et 
al., J. Exp. Med. 
178:27-47(1993) 


LIV 






YF 
WIV 






FLIV 
M 










HLA- 
DRB5 
*0101 


Rammensee et al,, 
Immunogenetics 
41:178-228 (1995) 


FYL 
M 






QVI 
M 










RK 






HLA- 
DRB3 
*0101 
(a) 


Corrected. Chicz 
et al., J. Bxp. Med. 
178:27-47(1993); 
acyusted based on 
Reammensee et al. 
(1995) 


VFL 
M 






YMI 
Q 








KR 









Table 1 lists the 28 peptides (14-1 through 32-1 1) that were synthesized to correspond to 
the 23 predicted epitopes, as described in Example 1, supra, and the number of MHC-binding 
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motifs contained -mthin each peptide. Four regions were predicted tor the 14 kUa protein, 
corresponding to six synthesized peptides (14-1 to 14-6); five regions were predicted for the 16 
kDa protein, corresponding to six synthesized peptides (16-1 to 16-6); five regions were predicted 
for the 19 kDa protein, corresponding to five synthesized peptides (19-1 to 19-5); and nine 
regions were predicted for the 32 kDa protein, corresponding to 11 synthesized peptides (32-1 to 
32-11). 

In Tables 1 and 2, start and stop numbers indicate amino acid positions within the native 
proteins. Amino acids are abbreviated with their single-letter designations. The alleles represented 
by the motifs identified in the peptide (firom ML 0595) are listed under the heading "Allele". 
Under the heading "# of Motifs", the number of binding motifs contained within the given peptide 
is given, based on the motifs shown in Table 3 (ML 0595). Table 3 is sorted by the number of 
motif matches, 0-16. 

Lymphoproliferation assay. Table 4 is a list of the Mtb-immune group and the Mtb-naive 
group. The results of the lymphoproliferation assays for the 28 peptides are listed in FIG. 2. 
Overall, 29 of 40 (72%) Mtb-immime subjects responded to one or more of the 28 peptides 
tested. Eleven (28%) of the Mtb-immune subjects failed to respond to any of the 28 peptides. 



Table 4a, Mtb-immune group 



Subject 


Age 


Sex 


PPD Status 


Countty of 
Origin 


BCG 


HLA 


1 


27 


F 


Unkn date PPD+ 


Dominican 
Republic 


Yes 




2 


61 


F 


Recent convert 


USA 


No 




3 


34 


F 


Unkn date PPD+ 


Azores 


Yes 




4 


66 


F 


Unkn date PPD+ 


USA 


No 




5 


36 


F 


Unkn date PPD+ 


Nigeria 


Yes 




6 


32 


M 


Unkn date PPD+ 


Columbia 


Yes 




7 


26 


M 


Recent convert 


India 


Yes 




S 


28 


M 


Unkn date PPD+ 


Mozambique 


Yes 




9 


34 


F 


Recent convert 


Haiti 


Yes 




10 


33 


F 


Unkn date PPD+ 


USA 


No 




11 


19 


F 


Unkn date PPD+ 


Domincan 

Republic 


Yes 




12 


35 


M 


Unkn date PPD+ 


USA 


No 




13 


38 


F 


PPD+ since 1992 


Dominican 
Republic 


Yes 




14 


35 


F 


Unkn date PPD+ 


USA 


No 




15 


28 


F 


PPD+ since 1992 


Cape Verde 


Unk 
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16 


28 


F 


Unkn date PPD+ 


USA ' 


No 


- 


17 


21 


M 


Unkn date PPD-l- 


Dominican 
Republic 


Unk 


- 


18 


18 


M 


Unkn date PPD+ 


Argentina 


Yes 




19 


33 


M 


Unkn date PPD+ 


Cape Verde 


Yes 




20 


53 


F 


PPD+ since 1970 


Canada 


No 




21 


54 


F 


PPD+ since 1991 


Israel 


Yes 




22 


66 


F 


PPD+ since 1993 


Phillipines 


No 





Table 4b. Mtb-naive group 



Subject 


Age 


Sex 


PPD Status 


Country of 
Origin 


BCG 


HLA 


1 


38 


F 


PPD negative 


USA 


No 




2 


40 


M 


PPD negative 


USA 


No 




3 


22 


F 


PPD negative 


USA 


No 




4 


21 


M 


PPD negative 


USA 


No 




5 


46 


F 


PPD negative 


USA 


No 




6 


28 


M 


PPD negative 


USA 


No 




7 


25 


F . 


PPD negative 


USA 


No 





All but six Mtb-immune subjects and one Mtb-naive control showed a proliferative 
response to TT (SI range 2 - 182). All Mtb-immune subjects showed an in vitro response to PPD, 
although the intensity of these in vitro responses varied greatly (SI range 2.2 - 166; median 20). 
Of the 41 subjects who were tested, all demonstrated a robust response to phytohemagglutinin. 

Of the 28 peptides synthesized, all were found to induce a proliferative response (SI > 2.0) 
in at least one Mtb-immune individual. The number of Mtb-immune responders varied from one 
(for peptide 16-3) to 13 (for peptide 19-1). Twelve peptides (14-2, 14-3, 16-1, 16-5, 19-1, 19-2, 
19-3, 32-3, 32-5, 32-6, 32-7, and 32-9) induced a proliferative response in six of more Mtb- 
immune subjects. 

Individual study subjects responses to these twelve broadly-recognized peptides are shown 
in FIG. 2. In 10 of the 19 (53%) Mtb-naive individuals, at least one of the 25 peptides induced a 
proliferative response (SI > 2.0). 

Certain peptides were identified to which none of the Mtb-naive controls responded, and 
to which a high proportion of the Mtb-immune subjects showed a response (16-5, 28% 
responders and 32-8, 25% responders). Six of the 19 Mtb-naive controls and 14 of the 40 Mtb- 
immune subjects showed at least one response to the Mtb-unique 14 kDa peptide. The number of 
subjects was too small to determine whether there was any relationship between number of 
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responses to peptides and dther BCG vaccination status or race/ethnicity; likewise, no 
relationship could be observed between BCG status or race/ethnicity and response/non-response 
to a particular peptide. 

In order to deternaine whether peptides with multiple motif matches were more likely to 
induce a response in multiple Mtb-immune subjects, the peptides were dichotomized by number of 
motif matches (0 to 4 and > 5) and by number of responders (0 to 5 and > 6), using the final list of 
ML 0595 motifs (Table 3). Peptides with at least five motif matches were more likely to induce a 
response in 8 or more subjects (71%) than peptides with fewer than five motif matches (14%) 
(RR 5.0, 95% CI 1-7 to 14). A regression analysis also demonstrates this relationship (R^2%) 
(FIG. 3). 

EXAMPLES 

MEASURING INDIVIDUAL RESPONSES TO OTHER PEPTIDES CONTAINING 
MOTIFS MATCHED TO SUBJECTS' HLA-DR ALLELE. 

HLA Variation in Populations. The distribution of MHC alleles varies firom population to 
population. In general, the MHC-peptide (epitope) interaction is governed by the sequence of the 
peptide: each MHC has its own constraints, which can be described as a pattern, or motif, 
characterizing the set of peptides that can bind in the binding groove of the MHC. While the 
distribution of MHC in populations inhabiting different regions of the world may restrict, to some 
extent, the relevance of selected epitopes in different human populations, means to surmount this 
difficulty have been proposed. For example, identification of epitopes that may be recognized in 
the context of more than one MHC, such as "promiscuous" or "clustered" MHC binding regions, 
may permit the development of vaccines that effectively protect genetically diverse human 
populations. 

HLA typing. At the time of PBMC isolation, a small sample of cells from sonie subjects 
was suspended in cell freezing medium (Sigma) and stored m liquid nitrogen. Using supemataat 
from the immortalized B95.8 cell line (ATCC, Rockville, MD), EBV-transformed B cell lines 
were generated from thawed PBMG (on occasion, fresh PBMC were used in this step). Cell lines 
were sent to the Rhode Island Blood Center, Providence, RI, where HLA-DR typing by the 
polymerase chain reaction (PCR) technique w^as performed for 18 of the 40 Mtb immunes, nine 
from the Providence cohort, and nine from the PhilHpine cohort. The HLA-DR type of each 
subject is listed in Table 4. Only DR typing was performed, as most published motifs included in 
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EpiMer ML059S motif lists belonged to the DR subtypes. Twenty-two of the subjepts were not 
BDLA-DR typed owing to insufi&cient cells available after freezing. 

Statistical analysis. All statistical analyses were performed in Microsoft Excel v4.0 or 
v5.0 (Microsoft Corporation, Redmond, WA) using a Macintosh Quadra 650 or a 
PowerMacintosh 7100 (Apple Computer, Inc., Cupertino, CA). The relative risk (RR) of 
response and the 95 % confidence intervals (CI) for this ratio were calculated using an Excel v4.0 
spreadsheet. 

Results- HLA typing. For the 18 HLA-DR typed subjects, there was an increased 
likelihood of observing a proliferative response to a peptide which contained a motif matched for 
the subject's allele, compared to cases where the peptide did not contain a motif matched for the 
subjects allele. When stratified by subject, and then pooled by the Mantel-Haenzel method, RR = 
1.5 (95% CI 0.9 to 2.5). Some of the discordant positive responses (46 cases) may have been due 
to presentation by a DP or DQ allele that was not represented in the EpiMer ML 0595 database. 
Some of the discordant negative responses (120 cases) may have been due to inaccurate motifs, 
inhibition of peptide binding by non-anchor residues, absence of T cells recognizing that particular 
peptide, insensitivity of the assay system, or the method of analysis. 

EXAMPLE 4: EPIMER ALGORITHM 

This study provides an in vitro assessment of EpiMer predictions for Mycobacterium 
tuberculosis (Mtb) vaccine candidate peptides, EpiMer, and other MHC-binding motif-based 
algorithms, may be usefijil methods for identifying "promiscuous" peptides, which can be 
recognized by a higher number of individuals in outbred human populations. The cost and time 
savings of this method over the traditional overlapping approach are substantial, and this method 
may eventually contribute to the development of a novel sub-unit vaccine against Mtb. 

The EpiMer algorithm is designed to identify peptides that have the ability to bind to 
multiple MHC alleles. Peptides with this property have been described as 'promiscuous* or 
'universal' epitopes. In this study, the number of study subjects who had a significant proliferative 
response to the peptides was associated with the number of MHC-binding motifs contained within 
the predicted peptide (RR = 5.0, 95% CI 1,7 to 14). However, the association between number of 
MHC-binding motifs contained witiiin a peptide and response to the peptide was not absolute, as 
demonstrated by several peptides which contained a large number of MHC-binding motifs but 
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stimulated in vitro response in only a few of the study subjects, and vice versa (32-3 and 32-9 
respectively). 

One reason for this may be that peptides which contain multiple anchor based binding 
motifs may also contain amino acids that have other features (such as bulky or chjurged side 
chains, or cleavage sites) which inhibit the peptides from bindii^ to certain MHC molecules 
(Boehncke, etal, J Immunol 150:331 (1993); Ruppert, etal, Cell 74:205 (1993)). In contrast, 
peptides that contained no motif matches according to our ML 0595 list may indeed contain 
MHC-binding motifs or ligands that have yet to be described or included. 

At the time of epitope prediction for the studies described, the MHC-binding motif 
database consisted of a total of 15 distinct human motife. Later, we foimd that 26 human class II 
MHC-binding motifs are utilized by EpiMer ML 0595 (Table 3). Some motifs that were used by 
EpiMer at the time of epitope prediction have since been shown to be inaccurate predictors of 
MHC-binding, and as such, are no longer included for use by the EpiMer algorithm. As more 
MHC-binding motifs are identified, and existing motifs are refined through further study, the 
algorithm's predictive capacity is expected to improve. 

For many of the subjects in the Mtb immune group, the date of PPD skin-test conversion 
was not known. This leaves open the possibility that some of these subjects might have been 
exposed to Mtb many years prior to the collection of their PBMC, while others may have been 
exposed more recently, resulting in a range of immune responses in our subject cohort as 
measured in the T cell proliferation assay. Within the Mtb naive control group, four of the Mtb 
naifve subjects showed proliferative responses to five or more peptides as well as an vitro 
response to PPD. These responses could be due to subclinical (and PPD skin-test negative) 
infection with Mtb, or to exposure to environmental mycobacteria, leading to cross-reactive 
proliferative responses to shared antigens (in the case of PPD) or to shared regions of protein 
sequences (in the case of responses to specific peptides) (Stanford, et al^ J Hyg Lond 76:205 
(1976)). Furthermore, the 14 kDa protein has been shown to be unique to Mtb; therefore 
proliferative responses seen in five Mtb naive individuals to peptides derived firom this antigen are 
difficuh to explain, unless subclinical exposure had occurred, or the particular peptide used in this 
assay is similar to T cell epitopes derived firom other antigenic proteins. Until better tests can be 
developed to confirm latent Mtb infection, it is difficult to determine how to classify PPD skin test 
positive individuals who have no known date of exposure to Mtb infection and few in vitro 
responses to Mtb antigens. 
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Similarly, non-responsiveness to the tetanus toxoid antigen can probably be explained by 
the fact that immunity derived by immunization with TT is not life-long, and that such acquired 
immunity wanes without frequent boosting (Gergen, et aL^ New Eng J Med 332:761 (1995)), 

OTHER EMBODIMENTS 

The details of one or more embodiments of the invention are set forth in the accompanying 
description above. Although any methods and materials similar or equivalent to those described 
herein can be used in the practice or testing of the present invention, the preferred methods and 
materials have been described. Other features, objects, and advantages of the invention will be 
apparent from the description and from the claims. Unless defined otherwise, all technical and 
scientific terms used herein have the same meaning as commonly understood by one of ordinary 
skill in the art to which this invention belongs. All patents and publications cited in this 
specification are incorporated by reference. 

The foregoing description has been presented only for the purposes of illustration and is 
not intended to limit the invention to the precise form disclosed, but only to the claims appended 
hereto. 
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CLAIMS 

WE CLAIM: 

1 . A vaccine comprising: 

2, Mycobacterium tuberculosis (Mtb) vaccine candidate peptide containing an 
amino acid sequence selected from the group consisting of SEQ ID NOs:l-28, in 
an immunologically acceptable excipient. 

2. The vaccine of claim 1, wherein the peptide is between 8 amino acids and 50 amino acids 
in length. 

3 . The vaccine of claim 1 , wherein the Mtb vaccine candidate peptide has an amino acid 
sequence selected from the group consisting of SEQ ID NOs: 1-28. 

4. The vaccine of claim 1, wherein the peptide is complexed to a carrier protein. 

5. The vaccine of claim 1, wherein the peptide is a recombinant fusion protein. 

6. The vaccine of claim 1, wherein the excipient is an adjuvant. 

7. A recombinant Mycobacterium tuberculosis (Mtb) vaccine candidate peptide, comprising: 

a peptide contmning an amino acid sequence selected from the group consisting of 
SEQ ID NOs: 1-28, wherein the peptide is expressed from a recombinant 
polynucleotide. 

8 . The recombinant peptide of claim 1, wherein the recombinant polynucleotide is a naked 
DNA vaccine. 

9. A method for inducing an mixrMycobacterium tuberculosis (anti-Mtb) immune response, 
comprising: 

administering to a mammalian subject a Mtb vaccine candidate peptide containing 
an amino acid sequence selected from the consisting of SEQ ID NOs: 1-28. 
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10. The method of claim 9, wherein the induction of an anti-Mtb immune response is the 
raising of an anti-Mtb antibody. 

11. The method of claim 9, wherein the mammalian subject is a human. 

12. The method of claim 9, wherein the administration is selected from the group consisting of 
orally, topically, parenterally, by viral infection, and intravascularly. 
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Fig, 3 



X Variable 1 Line Fit Plot 
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